Modeling Visual Compatibility through Hierarchical Mid-level Elements

نویسندگان

  • José Oramas M.
  • Tinne Tuytelaars
چکیده

In this paper we present a hierarchical method to discover mid-level elements with the objective of modeling visual compatibility between objects. At the base-level, our method identifies patterns of CNN activations with the aim of modeling different variations/styles in which objects of the classes of interest may occur. At the top-level, the proposed method discovers patterns of co-occurring activations of baselevel elements that define visual compatibility between pairs of object classes. Experiments on the massive Amazon dataset show the strength of our method at describing object classes and the characteristics that drive the compatibility between them.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hierarchical Implicit Shape Modeling

In this paper, a new hierarchical approach for part-based object recognition is proposed. Object detection methods based on Implicit Shape Model (ISM) efficiently handle deformable objects, occlusions and clutters. The structure of each object in ISM is defined by a spring like graph, hence parts independently vote to object properties. We introduce hierarchical ISM in which structure of each o...

متن کامل

Modeling Mid-level Visual Representations through Clustering in a Convolutional Neural Network

The nature of visual properties used in cortical perception is subject to considerable ongoing study. Features of intermediate complexity are particularly uncertain. Convolutional Neural Network (CNN) models, however, have proven to be quite effective in modeling human vision (Yamins et al., 2014) and have performed with great accuracy on image classification tasks (Krizhevsky et al., 2012). St...

متن کامل

Learning Discriminative Visual N-grams from Mid-level Image Features

Mid-level image features have been shown to be helpful to bridge the semantic gap between low-level and high-level image representations. Many existing methods to learn mid-level visual elements consider each mid-level feature individually, and do not take their mutual relationships into account. We follow the intuitive idea that learning discriminative combinations of visual elements can help ...

متن کامل

VISDA: an open-source caBIGTM analytical tool for data clustering and beyond

SUMMARY VISDA (Visual Statistical Data Analyzer) is a caBIG analytical tool for cluster modeling, visualization and discovery that has met silver-level compatibility under the caBIG initiative. Being statistically principled and visually interfaced, VISDA exploits both hierarchical statistics modeling and human gift for pattern recognition to allow a progressive yet interactive discovery of hid...

متن کامل

Video (GIF) Sentiment Analysis using Large-Scale Mid-Level Ontology

With faster connection speed, Internet users are now making social network a huge reservoir of texts, images and video clips (GIF). Sentiment analysis for such online platform can be used to predict political elections, evaluates economic indicators and so on. However, GIF sentiment analysis is quite challenging, not only because it hinges on spatio-temporal visual contentabstraction, but also ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1604.00036  شماره 

صفحات  -

تاریخ انتشار 2016